Text Embedding

The Best 1359 Text Embedding Tools in 2025

Jina Embeddings V3

Jina Embeddings V3 is a multilingual sentence embedding model supporting over 100 languages, specializing in sentence similarity and feature extraction tasks.

Transformers Supports Multiple Languages

Ms Marco MiniLM L6 V2

A cross-encoder model trained on the MS Marco passage ranking task for query-passage relevance scoring in information retrieval

Text Embedding English

Opensearch Neural Sparse Encoding Doc V2 Distill

A sparse retrieval model based on distillation technology, optimized for OpenSearch, supporting inference-free document encoding with improved search relevance and efficiency over V1

Transformers English

opensearch-project

Sapbert From PubMedBERT Fulltext

A biomedical entity representation model based on PubMedBERT, optimized for semantic relation capture through self-aligned pre-training

Text Embedding English

GTE-Large is a powerful sentence transformer model focused on sentence similarity and text embedding tasks, excelling in multiple benchmark tests.

Text Embedding English

Gte Base En V1.5

GTE-base-en-v1.5 is an English sentence transformer model focused on sentence similarity tasks, excelling in multiple text embedding benchmarks.

Transformers Supports Multiple Languages

Gte Multilingual Base

GTE Multilingual Base is a multilingual sentence embedding model supporting over 50 languages, suitable for tasks like sentence similarity calculation.

Transformers Supports Multiple Languages

polyBERT is a chemical language model designed to achieve fully machine-driven ultrafast polymer informatics. It maps PSMILES strings into 600-dimensional dense fingerprints to numerically represent polymer chemical structures.

Bert Base Turkish Cased Mean Nli Stsb Tr

A sentence embedding model based on Turkish BERT, optimized for semantic similarity tasks

Transformers Other

GIST Small Embedding V0

A text embedding model fine-tuned based on BAAI/bge-small-en-v1.5, trained with the MEDI dataset and MTEB classification task datasets, optimized for query encoding in retrieval tasks.

Text Embedding English

Gte Large En V1.5

GTE-Large is a high-performance English text embedding model that excels in multiple text similarity and classification tasks.

Transformers Supports Multiple Languages

Snowflake Arctic Embed M

Snowflake Arctic Embed M is a sentence transformer model focused on sentence similarity tasks, capable of efficiently extracting text features and calculating similarity between sentences.

Splade Cocondenser Ensembledistil

SPLADE model for passage retrieval, improving sparse neural information retrieval through knowledge distillation

Transformers English

Text2vec Base Chinese

A Chinese text embedding model based on the CoSENT (Cosine Sentence) model, which can map sentences to a 768-dimensional dense vector space and is suitable for tasks such as sentence embedding, text matching, or semantic search.

Text Embedding Chinese

A compact BERT-based Russian encoder capable of generating high-quality sentence embeddings

Transformers Other

Ms Marco MiniLM L2 V2

A cross-encoder model trained on the MS Marco passage ranking task for query-passage relevance scoring in information retrieval.

Text Embedding English

Ruri is a universal text embedding model for Japanese, focusing on sentence similarity and feature extraction tasks.

Text Embedding Japanese

KR SBERT V40K Kluenli Augsts

This is a Korean sentence embedding model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Transformers Korean

GTE-small is a general text embedding model trained by Alibaba DAMO Academy, based on the BERT framework, suitable for tasks such as information retrieval and semantic text similarity.

Transformers English

Ms Marco MiniLM L12 V2

A cross-encoder model trained on the MS Marco passage ranking task for relevance ranking in information retrieval.

Text Embedding English

All Minilm L6 V2 With Attentions

This is an ONNX port of sentence-transformers/all-MiniLM-L6-v2, adjusted to return attention weights, specifically designed for BM42 search scenarios.

Transformers English

GTE-small is a compact general-purpose text embedding model suitable for various natural language processing tasks, including sentence similarity calculation, text classification, and retrieval.

Text Embedding English

Sbert Large Nlu Ru

This is a large Russian language model based on the BERT architecture, specifically designed for generating sentence embeddings with case-insensitive processing support.

Transformers Other

A streamlined version of the LaBSE model specialized for English and Russian, significantly reducing model size while preserving original embedding quality

Transformers Supports Multiple Languages

Sentence Similarity Spanish Es

This is a Spanish sentence similarity calculation model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional vector space.

Transformers Spanish

Roberta Base Bne Finetuned Msmarco Qa Es Mnrl Mn

This is a Spanish-based sentence-transformers model specifically designed for question-answering scenarios, capable of mapping sentences and paragraphs into a 768-dimensional vector space, suitable for semantic search and clustering tasks.

Text Embedding Spanish

Russian universal sentence encoder, based on the sentence-transformers framework, specifically designed to extract 1024-dimensional dense vectors for Russian text

Text Embedding Other

Bge Small En V1.5 Onnx Q

Quantized ONNX version of the BAAI/bge-small-en-v1.5 model for text classification and similarity search.

GTE-Base is a general-purpose text embedding model focused on sentence similarity and text retrieval tasks, performing well on multiple benchmarks.

Text Embedding English

This is the ONNX quantized version of the BAAI/bge-m3 model, supporting three functionalities: dense retrieval, multi-vector retrieval, and sparse retrieval, covering over 100 languages.

Sup Simcse Roberta Large

Supervised SimCSE model based on RoBERTa-large for sentence embedding and feature extraction tasks.

GIST Embedding V0

GIST-Embedding-v0 is a sentence embedding model based on sentence-transformers, mainly used for sentence similarity calculation and feature extraction tasks.

Text Embedding English

bge_micro is a lightweight model focused on sentence similarity calculation, suitable for various natural language processing tasks.

Ms Marco TinyBERT L2 V2

A lightweight cross-encoder trained on the MS Marco passage ranking task for query-passage relevance scoring in information retrieval

Text Embedding English

Sapbert From PubMedBERT Fulltext Mean Token

Biomedical entity representation model based on PubMedBERT, optimized for semantic relation capture through self-alignment pre-training

Nomic Embed Text V2 Moe

Nomic Embed v2 is a high-performance multilingual Mixture of Experts (MoE) text embedding model supporting approximately 100 languages, excelling in multilingual retrieval tasks.

Text Embedding Supports Multiple Languages

Gte Qwen2 1.5B Instruct

A general-purpose text embedding model based on Qwen2-1.5B, supporting multilingual and long-text processing

Gte Multilingual Reranker Base

The first multilingual reranking model in the GTE series, supporting 70+ languages with high performance and long text processing capabilities.

Transformers Supports Multiple Languages

A Japanese-English bilingual sentence feature extraction model based on modernbert-ja-310m, supporting sentence similarity computation and text classification tasks

Text Embedding Supports Multiple Languages

Mmlw Retrieval Roberta Large

MMLW (I Must Get Better Messages) is a neural text encoder for Polish, optimized for information retrieval tasks.

Transformers Other

Ms Marco MiniLM L4 V2

A cross-encoder model trained on the MS Marco passage ranking task for scoring query-passage relevance in information retrieval

Text Embedding English

Snowflake Arctic Embed L V2.0

Snowflake Arctic Embed v2.0 is a multilingual sentence embedding model that supports text feature extraction and sentence similarity calculation for over 100 languages.

Transformers Supports Multiple Languages

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase